Quaero@H1: An Interface to High-p T HERA Event Data 
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Distributions from high-pr HERA event data analyzed in a general search for new physics at HI 
have been incorporated into Quaero, an algorithm designed to automate tests of specific hypotheses 
with high energy collider data. The use of QuaeroQHI to search for leptoquarks, R-parity violat- 
ing supersymmetry, and excited quarks provides examples to develop intuition for the algorithm's 
performance. 
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I. INTRODUCTION 

The publication of most searches for new physics takes 
the form of an exclusion contour in a two-dimensional 
model parameter space, with alternative interpretations 
typically requiring a re-analysis of the data. Publishing 
the data themselves (either in raw form or as 4-vectors) 
is unsatisfactory because substantial expert knowledge 
is required for an accurate analysis. This article contin- 
ues a recently introduced paradigm of publishing frontier 
energy collider data via an interface that can be used to 
quickly test any specific hypothesis against collider event 
data, with the analysis performed by an algorithm that 
encapsulates expert knowledge of the experiment. In this 
way exclusion contours (or discovery regions) can be pro- 
duced on demand. This type of interface can be used with 
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well understood data to test models motivated either by 
new theoretical insights or from the consideration of sub- 
sequently collected data. Such an interface can also be 
used with freshly collected data in the exploratory pro- 
cess of fitting together an underlying physical interpreta- 
tion for observed discrepancies while exploring in parallel 
more mundane experimental explanations. 

The HI detector has recorded collisions of electrons 
and protons at center of mass energies of 301 and 
319 GcV in runs of the HERA collider between 1992 and 
2000. These data have been used to understand a wide 
spectrum of physics, from the internal structure of the 
proton to tests of exotic physics at the electroweak scale. 
The analysis of HERA-I data has resulted in over one 
hundred publications in internationally recognized jour- 
nals, and a general analysis of all high px final states has 
recently been performed in the HI General Search [1]. 

This article describes the interpretation of the distri- 
butions published by the HI Collaboration in a general 
search for new physics using the Quaero framework. 
Quaero was used previously by the D0 collaboration to 
automate the optimization of searches for new physics in 
the Tevatron Run I data [2] . The HI General Search is re- 
viewed briefly in Sec. II, with Sec. Ill covering Quaero's 
knowledge of the HI detector response. Section IV briefly 
reviews the Quaero algorithm. Section V contains the 
results of several analyses that have been performed using 
Quaero@H1, allowing a comparison to previous results. 
A summary is given in Sec. VI. 



II. GENERAL SEARCH 

The HI General Search has been published in Ref. [1]. 
This search, briefly described below, investigates events 
with high-pT objects (electrons, muons, jets, photons, 
and the presence of missing transverse energy) produced 
in ep collisions at HERA. The histograms published by 
HI (the invariant masses and the sums of the transverse 
momenta for high-py events) are used as input to the 
Quaero algorithm in the studies described in this paper. 
This section briefly reviews the elements of this general 
search that have been incorporated into Quaero@H1. 
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A detailed description of the H 1 detector can be found 
in Refs. [3, 4], The main trigger for events with high 
transverse momentum is provided by the liquid argon 
calorimeter. The trigger efficiency is close to 100% for 
events containing an electron or photon with transverse 
momentum greater than 20 GeV, 90% for events contain- 
ing one or more jets with pt > 20 GeV or with missing 
transverse momentum ($ T ) greater than 20 GeV, and 
about 70% for di-muon events [1]. 

The H 1 data available within Quaero correspond to 

• 36.4 pb" 1 of 27.5 GeV positrons on 820 GeV pro- 
tons, at a center of mass energy of 301 GeV; 

• 13.8 pb- 1 of 27.5 GeV electrons on 920 GeV pro- 
tons, at a center of mass energy of 319 GeV; and 

• 66.4 pb" 1 of 27.5 GeV positrons on 920 GeV pro- 
tons, at a center of mass energy of 319 GeV. 

Standard object identification criteria are used to de- 
fine electrons (e), muons (//), photons (7), and jets (j) 
[I]. All identified objects are required to have pr > 
20 GeV and 10° < 9 < 140°. The charge of leptons 
is not distinguished; all leptons are taken to have posi- 
tive charge. No attempt is made to identify jets contain- 
ing heavy flavor. Events containing fewer than two of 
these objects are discarded. All objects are required to 
be isolated by a minimum distance of one unit in the r}-(f> 
plane. A neutrino object is defined for missing transverse 
momentum above 20 GeV. 

The experimental sources of systematic error affecting 
the modeling of these data are identical to those consid- 
ered in Ref. [..]. Although Quaero allows a full specifi- 
cation of correlated uncertainties, all HI sources of sys- 
tematic error are treated as uncorrected. 

Several Monte Carlo event generators are combined to 
estimate dominant Standard Model processes [1]. These 
generated events serve as the reference model to which 
hypotheses presented to Quaero are compared. Here 
and below "Standard Model," "background," and "refer- 
ence model" are used interchangeably. 

III. TURBOSIMQHl 

To keep Quaero fast and standalone, a fast detector 
simulation algorithm (TurboSim@H1) is built in accor- 
dance with the HI detector simulation. It is based on 
a large lookup table of one half million lines mapping 
particle-level objects to objects reconstructed in the de- 
tector. Sample lines in this table are shown in Fig. 1. 
The total table is roughly 100 MB, and as such can be 
read into memory and searched as a multivariate binary 
tree. The resulting simulation runs at roughly 10 ms per 
event. 

Particle efficiencies are handled through lines in the 
TurboSimQHI table that map a parton level object to 
no reconstructed level object. Misidentification probabil- 
ities are handled through lines that map a particle-level 



object to a reconstructed-level object of a different type. 
The merging and overlap of particles is handled by config- 
urations in the table that map two or three particle-level 
objects to zero or more reconstructed-level objects. 

Validation of TurboSim@H1 has been performed by 
running an independent sample of one million events 
through both the HI full simulation and TurboSimQHI. 
The event classification and the kinematic distributions 
of the events from the two simulation chains arc com- 
pared using a Kolmogorov-Smirnov (KS) test. TUR- 
boSim@H1 has been found in agreement with the full 
simulation of HI. 



IV. QUAERO 

Quaero provides a convenient interface to the under- 
standing represented by high energy collider data, back- 
grounds, and detector response. This interface is de- 
signed to facilitate the test of any specific hypothesis 
against such data. The Quaero web page, shown in 
Fig. 2, is available online at Ref. [5]. 

A physicist wishing to test her hypothesis against HI 
data will provide her hypothesis in the form of commands 
to one of the built-in event generators [6, 7, 8]. Quaero 
uses the specified event generator to generate signal 
events corresponding to e + p collisions at 301 GeV, e~p 
collisions at 319 GeV, and e + p collisions at 319 GeV. The 
response of the HI detector to these events is simulated 
using TurboSim@H1. The output of TurboSim@H1 is 
a text file containing simulated events, normalized such 
that the total number of expected signal events produced 
at HI is equal to the sum of all event weights in the file. 

Three distinct samples of events exist at this point: 
the data T>; the Standard Model prediction SM; and the 
hypothesis TL. which is the sum of included Standard 
Model processes and the physicist's signal. Each sample 
of events is partitioned into exclusive final states, catego- 
rized by reconstructed objects withp T > 20 GeV. In each 
exclusive final state, a pre-defined list of two variables — 
the summed scalar transverse momentum px) and the 
invariant mass of all objects (m-aii) — are ranked accord- 
ing to the difference between the Standard Model pre- 
diction and the physicist's hypothesis TC. The variable 
showing the most difference is used [19]. 

In this variable space, densities are estimated from the 
Monte Carlo events predicted by SM and H. These 
densities are used to define a discriminant, which is 
binned to distinguish SM from H. The likelihood ratio 
C = p(T>\H) / p(T>\SM) is determined using this binning, 
and systematic errors are integrated numerically. 

The result returned by Quaero is the decimal log- 
arithm of this likelihood ratio. The measurement of 
model parameters using Quaero is easily accomplished 
by graphing log 10 C as a function of varied parameter val- 
ues, with multiple Quaero submissions. Distributions of 
the Standard Model prediction SM, the prediction of the 
physicist's hypothesis H, and the HI data T> in the most 
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FIG. 1: Ten sample lines in the TurboSim@H1 lookup table, chosen to illustrate TurboSim's handling of interesting cases. 
Each line begins with the event's type, run and event number, and vertex position. To the left of the arrow ("->") is a list of 
nearby parton-level objects; to the right of the arrow is a list of corresponding reconstructed-level objects. The first line shows 
a positron correctly identified as a positron, while the second line shows a positron that has not been correctly identified. The 
third line shows a nicely reconstructed jet; in the fourth line the jet has been split into two; in the fifth line the jet is either 
not reconstructed or reconstructed with pr < 20 GeV. The sixth line shows a jet identified as a muon. The seventh line shows 
a nearby positron and muon that have been reconstructed as one jet; the eighth line shows a positron with a photon radiated 
sufficiently nearby that the two are agglomerated into a single reconstructed-level positron. The ninth and tenth lines show 
muons that are missed and reconstructed, respectively. 
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FIG. 2: The Quaero interface, designed for HERAT, LEP 2, 
Tevatron II, and the future LHC. A new hypothesis can be 
provided as commands to one of several event generators. 
The result returned quantifies the extent to which the data 
(dis)favors the new hypothesis relative to the Standard Model. 



ison of the sensitivity of Quaero 's results (which take 
the form of the decimal logarithm of a likelihood ratio) 
with previous analyses (which typically take the form of 
95% confidence level exclusion limits) can be made by 
comparing log 10 C = — 1 with the 95% confidence level 
exclusion limit [20]. The decimal logarithm of the likeli- 
hood returned by Quaero can be converted into exclu- 
sion limits, measurements with errors, or (potentially) 
statements of discovery, perhaps with multiple Quaero 
submissions. 

The examples considered in this section include a 
search for scalar leptoquarks, R-parity violating super- 
symmetry, and an excited quark. 



A. Leptoquarks 



relevant variable in each of the most relevant final states 
are returned to the querying physicist in an email along 
with her result. 

Further details of the Quaero algorithm are provided 
in Ref. [9]. 

V. EXAMPLES 



Quaero@H1 is first used to search for leptoquarks, 
particles possessing both lepton and baryon quantum 
numbers that arise naturally in Grand Unified Theories. 
Attention is restricted to a scalar leptoquark coupling to 
a positron and an up quark. The coupling A of the LQ- 
e-u vertex and the leptoquark mass ttt-lq are allowed to 
vary. The interaction Lagrangian is assumed to be of the 
form 



Quaero has been used to test models that have pre- 
viously been considered at HI, in order to benchmark 
Quaero 's sensitivity, and to test models that have not 
yet been considered at HI, in order to see how Quaero 
performs on novel searches for new physics. These ex- 
amples provide intuition for the Quaero algorithm: its 
strengths, and its limitations. 

A rough, non-rigorous, but nonetheless useful compar- 



C = A LQ ur eL + h.c 

(LQ**Z? M LQ), (1) 

where LQ is a scalar leptoquark field; ur and repre- 
sent a right-handed anti-up quark and left-handed elec- 
tron; is the gluon field; and g s = \JAitoi s w 1.2. The 
MadEvent [8] input to Quaero corresponding to these 
Lagrangian terms is shown in Fig. 3. 
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############################################### 

Name Anti Spin Mass Width Color 
xxxx xxxx sfv GeV GeV sto 

PARTICLE lq lq~ s 301 0.54 t 
############################################### 

Interacting particles Coupling 
INTERACTION u e- lq 0.3 

INTERACTION e- u lq" 0.3 

INTERACTION g lq" lq 1.22 

############################################### 



FIG. 3: MadEvent input given to Quaero to specify the La- 
grangian terms in Eq. 1. The line beginning with PARTICLE in- 
troduces into the theory a color triplet scalar leptoquark with 
mass 301 GeV. The three lines beginning with INTERACTION 
introduce new vertices and specify the coupling strength at 
each vertex. 
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The likelihood ratio Quaero returns should be in- 
terpreted as an update of betting odds for or against 
the provided hypothesis. For the hypothesis of a scalar 
leptoquark with mass mLQ = 301 GeV, Quaero finds 
log 10 £ = —1.5. If betting odds on this hypothesis were 
(say) 100:1 against before looking at these data, then 
these data indicate those odds should be adjusted by 
an extra factor of 10 — 1-5 w 32:1 against. Betting odds 
against this hypothesis after having run this request are 
now 3200:1. 

Quaero also returns a link to plots showing how the 
result has been obtained. The cover page to these dis- 
tributions contains the requester's name, email address, 
and the Quaero request number, together with the con- 
tributions of each experiment to the final Quaero re- 
sult. One plot is returned for each of the final states 
contributing greater than 0.1 to the decimal logarithm of 
the likelihood ratio. 

The result of Quaero 's search for leptoquarks with 
interactions specified by the Lagrangian of Eq. 1 with A = 
0.3 and with mass up to 302 GeV is shown in Fig. 4. In all 
cases Quaero finds log 10 C < 0, indicating the HI data 
favor the Standard Model over the provided hypotheses. 

In addition to varying the assumed leptoquark mass, 
the coupling strength A, the signal production cross sec- 
tion, or an overall fc-factor can be separately specified. 
An algorithm for turning this Quaero request into a 
standard 95% confidence level exclusion limit involves 
submitting several requests assuming a range of lepto- 
quark masses and a range of different cross sections at 
each mass, and computing the 95% confidence level cross 
section limit at each mass assuming a prior distribution 
for that cross section. Masses for which the 95% confi- 
dence level cross section limit is smaller than the theo- 
retical cross section arc said to be excluded at 95% con- 
fidence. 

Intuitively, the 95% confidence exclusion region corre- 
sponds to a region where the signal hypothesis is moder- 
ately disfavored by the data. If the threshold is chosen to 



FIG. 4: Quaero's output (log 10 C) for the interaction La- 
grangian of Eq. 1 with A = 0.3 as a function of assumed 
leptoquark mass (ttilq), shown as the solid line through filled 
circles. The evidence Quaero expects to provide in favor of 
the Standard Model SM is shown as the (dark, red) dashed 
curve with log 10 £ < 0; the evidence Quaero expects to pro- 
vide in favor of the hypothesis H is shown as the (light, green) 
dashed curve with log 10 C > 0. The result of a previous anal- 
ysis excludes leptoquark masses ttilq < 300 GeV at a con- 
fidence level of 95%, indicated by the vertical (gray) dashed 
line. 
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FIG. 5: Quaero's log likelihood ratio as a function of the 
coupling A and leptoquark mass mLQ, in the scenario defined 
by the additional interaction terms of Eq. 1. Shown separately 
are results from Quaero using data from D0 Tevatron Run 
I (left) and using data from HI HERA Run I (right). All 
shaded area corresponds to log 10 L < 0. 



be 10:1 against (log 10 £ = —1), then leptoquarks of mass 
less than 301 GeV are excluded. This result is in accord 
with the result of a previous analysis of this signal, de- 
scribed in Ref. [10], which derives a comparable limit for 
leptoquarks of this type. 

The subset of D0 Run I data made available in the 
first implementation of Quaero [2] has been incorpo- 
rated into the current version of Quaero. Plots of 
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FIG. 6: Quaero's log likelihood ratio as a function of the 
coupling A and leptoquark mass mLQ, in the scenario defined 
by the additional interaction terms of Eq. 1, combining data 
from D0 Tevatron Run I and HI HERA Run I. All shaded 



area corresponds to log 10 L < 0. 



############################################### 

Name Anti Spin Mass Width Color 
xxxx xxxx sfv GeV GeV sto 

PARTICLE usq usq~ s 200 1 t 

PARTICLE chi- chi+ f 150 1 s 

############################################### 

Interacting particles Coupling 
INTERACTION d e- usq 0.01 
INTERACTION chi- d usq" 0.05 
INTERACTION mu- chi- lis 0.10 
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INTERACTION 
INTERACTION 



e- 

d 

chi- 
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lis 
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0.05 
0.10 



############################################### 
PARAMETER long_lived_mass = 50 
############################################### 



FIG. 8: MadEvent input given to Quaero to specify a signal 
corresponding to the new particles and interactions that must 
be introduced in order for a diagram such as that shown in 
Fig. 7 to occur. The two lines beginning with PARTICLE intro- 
duce into the theory a u squark with spin 0, mass 200 GeV, 
width 1 GeV, and a color triplet; and a x with spin 1/2, 
mass 150 GeV, width 1 GeV, and a color singlet. The six 
lines beginning with INTERACTION introduce new vertices and 
specify the coupling strength at each vertex. The particle lis 
is a "long-lived scalar" that is implicitly added to the particle 
content of the theory; a long-lived fermion llf and a long- 
lived vector llv are similarly available. The line beginning 
with PARAMETER specifies the mass of the lis. 





FIG. 7: A Feynman diagram generated by MadGraph [11] 
given the additional particles and interactions specified in 
Fig. 8. Here u~ is an up squark, chi is a chargino, and lis 
is a long-lived scalar, in this case a muon sneutrino. 



Quaero's result in the parameter plane of A and TOlq 
using the HI and D0 data separately are shown in Fig. 5. 
Quaero's result using HI and D0 data combined is 
shown in Fig. 6. Quaero is able to make use of the 
Tevatron's A-independent exclusion of leptoquarks with 
low mass and HERA's A-dependent exclusion at higher 
masses to rule out more of the parameter space than ei- 
ther collider is able to on its own. 



B. R- parity violating supersymmetry 

An R-parity violating supersymmetry scenario is con- 
sidered as a possible explanation for the fij v events whose 
interestingness is quantified in Ref. [1] . R-parity violating 
supersymmetry can lead to squark production at HERA. 
If the muon sneutrino happens to be the lightest super- 
symmetric particle, then diagrams such as that shown in 
Fig. 7 can give rise to the final state \ijv. This repre- 
sents a scenario not yet tested by cither of the HERA 
experiments. 

Most new physics scenarios are specified "top-down," 
starting with the general Lagrangian and then restricting 
the resulting phenomenology through a judicious choice 
of values of the model parameters. In this subsection a 
"bottom-up" approach is adopted to illustrate another 
way in which Quaero may be used to understand fea- 
tures of the data in terms of the underlying physical the- 
ory, in the spirit of Bard [12]. A new physics process 
is guessed, together with required masses and couplings, 
and rigorously tested using Quaero. Iteration of this 
procedure allows a fit of parameters. 

Ignoring the full sparticle spectrum, the new particles 
that must be postulated to allow the diagram shown in 
Fig. 7 are a scalar color triplet (u) , an electrically charged 
fermion color singlet (x + ), an d a stable scalar without 



6 



hera h 1 



J M 



log,, 

<log„ 



f2.6 
-0.4 




200 GeV 































f , , , , I , , , , I 







0.25 0.5 0.75 1 
D 




100 150 200 
sumPt 



FIG. 9: Plots illustrating Quaero's testing of the R-parity 
violating supersymmetry scenario described in the text, with 
the muon sneutrino playing the role of the lightest supersym- 
metric particle. The contribution of the postulated signal 
is seen (upper left) by comparing the distributions predicted 
by the hypothesis TL (light, green) to the Standard Model 
SM (dark, red) in the final state jn + in Quaero's chosen 
variable sumPt (%2pt)- HI data are shown as (black) filled 
circles. The upper right figure shows the density estimates 
constructed by Quaero in this variable. Bins in the result- 
ing discriminant are shown in the lower two figures. Shading 
in the lower two plots indicates the bin D < 0.54 corresponds 
to 40 < ^2pt < HO, and the bin D > 0.54 corresponds to the 
signal-rich region 110 < ^2pt- The vertical axis in the lower 
right plot is without meaning, intended simply to give "thick- 
ness" to the shaded regions in sumPt. The number (black) 
at upper right is the log likelihood ratio log 10 £ that would 
be found by Quaero if that final state were analyzed alone, 
without integration over systematic errors; the number (gray) 
just below this is the log likelihood ratio expected if the data 
are drawn from the Standard Model distribution. Systematic 
errors are included in the final result returned by Quaero. 



strong or electromagnetic interactions {v^). Three new 
interactions must also be postulated, corresponding to 
the three vertices of Fig. 7. 

These new particles and interactions can be provided 
to Quaero as MadEvent input, shown in Fig. 8, with- 
out specifying the complete rcnormalizable theory. 

Taking the widths of each new particle to be small 
compared to experimental resolution, this scenario con- 
tains six new parameters: three new particle masses and 
three new coupling strengths. The kinematics of the \ijv 
events suggest ma ~ 200 GeV and mp < 50 GeV, while 
bounds from LEP 1 indicate mp > Mz/2, leading to 
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FIG. 10: Quaero's log likelihood ratio (solid curve) as a 
function of assumed m x . Also shown is Quaero's expected 
evidence (dashed curves) as a function of this parameter, if 
the data are drawn from the hypothesis (light, green) or the 
Standard Model (dark, red). The hypothesis provides a better 
fit to the data than the Standard Model for all values of m x 
in the range shown, corresponding to an update of betting 
odds of « 300:1 in favor. 



the choice of m^ = 200 GeV and mp = 50 GeV for 
this example. The number of events observed by HI in 
the njv final state suggest a signal production cross sec- 
tion of roughly 0.1 pb. Considering the single diagram of 
Fig. 8, the three new couplings affect the phenomenology 
of the postulated signal only by multiplying the overall 
cross section by the square of their product. The x~d- 
u and pu-x-v^ couplings are of electroweak strength; the 
R-parity violating coupling d-e-u is chosen so that the 
cross section of the process shown in Fig. 7 is 0.1 pb. 
The remaining parameter m x is allowed to vary between 
75 GeV and 175 GeV. 

Figure 9 shows Quaero's analysis of this hypothesis 
with the choice m x = 150 GeV. As expected, the final 
state j^{v) contributes most to the final result. Quaero 
chooses to consider the variable ^2pt in this final state. 
Quaero's density estimates and choice of binning are 
shown in the right panes of Fig. 9. The left panes of 
Fig. 9 suggest that the data is better fit by this hypothesis 
than by the Standard Model alone, a conclusion Quaero 
quantifies in its returned result of log 10 £ = 2-5. [21] 

Quaero's result log 10 £ as a function of assumed m x is 
shown in Fig. 10. Over the range shown, this hypothesis 
is found to fit the data better than the Standard Model 
alone, and the decimal logarithm of the likelihood ratio 
is log 10 C « 2.5. As mentioned previously, this likelihood 
ratio may be interpreted in terms of an "update of betting 
odds" for this hypothesis relative to the Standard Model. 
If betting odds against this hypothesis are (say) 10 7 :1 
before the HI data are considered, these betting odds 
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FIG. 11: MadEvent input given to Quaero to specify an 
excited quark signal. The symbols u and d denote the Stan- 
dard Model up quark and down quark; ux and dx denote an 
excited up quark and an excited down quark. The line be- 
ginning with PARTICLE introduces into the theory an excited 
up quark with spin 1/2, mass 200 GeV, width 0.04 GeV, and 
a color triplet. The eight lines beginning with INTERACTION 
introduce new vertices and specify the coupling strength at 
each vertex, with dmx indicating a dipole moment coupling. 
An excited down quark and its interactions are similarly spec- 
ified. 



should be revised in light of these data to rts 30000:1 
against. 



C. Excited quark 

If the known fermions are composite, a clear signal 
would be the detection of their excited states. At HERA, 
the hypothesis of an excited quark can be taken to corre- 
spond to new terms in an interaction Lagrangian of the 
form [141 



C = 



^ (g.f. 



^ (lis 



+h.c, 



W 



I"' 



u 

(2) 



where a*" = (i/2) [7^7"]; g s , g, and g' are the 577(3), 
SU(2), and {7(1) gauge couplings; G% v , W M „, and 
are the SU(3), SU(2), and 17(1) field strength tensors; A 
is the compositeness scale; q represents the first genera- 
tion quark doublet; and q* is the first generation excited 
quark doublet. These terms, with a specific choice of 
parameter values, can be specified with MadEvent in- 
put shown in Fig. 11. The choice / = /',/« = 0, and 
m u * = m,d* leaves just two parameters: //A and the 
excited quark mass m q * . 

Parameters specified in Fig. 11 include the mass of the 
u* , its coupling to its Standard Model counterpart and 
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FIG. 12: Plots illustrating Quaero's testing of the excited 
quark scenario described in the text, with parameters //A = 
0.01 and m,« = 200 GeV. The contribution of the excited 
quark signal is seen by comparing the distributions predicted 
by the hypothesis H (light, green) to the Standard Model SM 
(dark, red). HI data are shown as (black) filled circles. The 
most useful final state for distinguishing between the excited 
quark hypothesis TL and the Standard Model SM is found to 
be followed in importance by 2e + j, e + j[i + , and e + jj. 

The most important variable chosen by Quaero in each final 
state is shown; in the final state e + j/j, + , Quaero decides it 
has insufficient Monte Carlo events to adequately populate 
a variable space, and simply considers the total number of 
events. The number (black) at upper right is the decimal log 
likelihood ratio log 10 C that would be found by Quaero if 
that final state were analyzed alone, without integration over 
systematic errors. Systematic errors are incorporated into the 
final result returned by Quaero. 



photon in the first two INTERACTION lines in Fig. 11, its 
coupling to its unexcitcd weak isospin partner and W in 
the third and fourth INTERACTION lines, its coupling to 
its Standard Model counterpart and Z in the fifth and 
sixth INTERACTION lines, its coupling to the photon in 
the seventh INTERACTION line, and its coupling to the 
Z boson in the eighth and final INTERACTION line. An 
excited down quark d* is introduced similarly. 

Figure 12 summarizes Quaero's analysis of this hy- 
pothesis with the choice //A = 0.01 and m 9 » = 200 GeV. 
Shown are the four final states contributing most to the 
final result. 

Quaero's result log 10 C function of assumed 



coupling //A and excited quark mass m q * is shown in 
Fig. 13. The current result from Zeus [13], which makes 
use of 47.7 pb _1 of e + p data at 300 GeV, is shown in 
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FIG. 15: Quaero's log likelihood ratio as a function of //A 
and m q * , under the assumptions m„* 



/ = -/', and 

f B = (left), and under the assumptions m q * = m u * — Im^* , 
f = /', and f s = (right). All shaded area corresponds to 
log 10 L < 0. 



- 2md*, / = /'; an d fs = 0. All parameter 
points considered are found to have log 10 L < 0. 



FIG. 13: Quaero's log likelihood ratio as a function of //A 
and m q - , under the assumptions m u * = rrid* , f = /', and 
f s = 0. All shaded area corresponds to log 10 L < 0. 
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FIG. 14: A previous result from Zeus in the parameter plane 
of //A and m q * , under the same simplifying assumptions / = 
/', f s = 0, and m u * = md* , from Ref. [13]. 



Fig. 14. 

Quaero can also be used to test other parameter 
points within the parameter space defined by /, /', / s , 
"T-u*, rrid* , and A. Fig. 15(a) shows the parameter plane 



of //A and 



under the assumptions m u * 



f = — /', and f s = 0. Fig. 15(b) shows the pa- 
rameter plane of //A and m u *, under the assumptions 



VI. SUMMARY 

The histograms of the invariant masses and the sum of 
transverse momenta from the high-pT events selected in 
a general search for new physics at HI have been incor- 
porated into Quaero, a framework for automating tests 
of hypotheses against data. The resulting interface is 
called Quaero@H1. New physics scenarios can be pro- 
vided to this interface in the form of commands to one 
of several commonly used event generators, which evalu- 
ate the short-distance consequences of each scenario. An 
automated analysis algorithm within Quaero optimizes 
selection to distinguish between the new scenario and the 
Standard Model, returning a single number quantifying 
the extent to which the data (dis)favor the new hypoth- 
esis relative to the Standard Model alone, together with 
plots allowing the user to understand how the analysis 
has been performed. The use of Quaero@H1 has been 
illustrated with searches for leptoquarks, R-parity violat- 
ing supersymmctry, and excited quarks. 
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I] Consider a model with an overall cross section a as its 
only free parameter, and a flat Bayesian prior on a. Af- 
ter performing an analysis, suppose the posterior cross 
section distribution p{o) is a gaussian centered at zero, 
with the restriction a > 0. Then models with a greater 
than two standard deviations away from zero are ex- 
cluded at a confidence level of 95%. These correspond 
to models with log 10 C < log 10 exp (— ^2 2 ) = —0.87. 
Suppose instead the posterior cross section distribution 
p(cr) is a decreasing exponential, with the restriction 
a > 0. Then models with a greater than three times 
the decay length of the exponential are excluded at a 
confidence level of 95%. These correspond to models 
with log 10 C < log 10 exp(— 3) = —1.3. In this article 
log 10 C = — 1 is highlighted as a rough and convenient 
choice for the purpose of building intution when compar- 
ing with previous results. 

] Quaero's apparent insensitivity to m x is due to restric- 
tion to the crude variables ^2pt and m a n; allowed unre- 
stricted choice of kinematic variables, Quaero chooses 
observables that closely approximate m x . The model- 
independence of the variables ^2 pr and m a ii are the rea- 
son these variables are used in the model-independent 
analysis of Ref. [1], and the reason the variable ^2pt is 
used in Sleuth [15, 16, 17, 18]. 



